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Cloning vectors for expression fo exogenous protein. 
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The construction of a recombinant DNA cloning vector 
useful for expressing exogenous protein is described, which 
comprises 
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A DNA segment containing a functional origin of 
replication; 

one or more DNA segments, each of which conveys to 
a transformable host cell a property useful for selec- 
tion when said vector is transformed into said host 
cell: and 

a DNA segment comprising a sequence that defines, in 
tandem, 

the promoter off a lipoprotein expression control 
sequence, 

the 5' untranslated region of e lipoprotein expression 
control sequence and 

a translation start codon followed, without interposi- 
tion of a portion or all of a nucleotide sequence coding 
for endogenous protein, by a nucleotide sequence 
coding for an exogenous protein or by a nucleotide 



sequence coding for an enterokinase cleavage site to 
which is joined, without interruption, a nucleotide 
sequence coding for an exogenous protein. 



Croydon Printing Company Ltd 



10 



0095361 

X-5872 _x_ 



CLONING VECTORS FOR EXPRESSION OF EXOGENOUS PROTEIN 

This invention relates to novel DNA sequences 
and to cloning vectors (vehicles) useful in the pro- 
duction of protein products. 

Masayori Inouye and various of his co-workers 
have carried out extensive studies involving gene 
sequences coding for outer membrane proteins of gram- 
negative bacteria, in particular, the lipoprotein. 
These investigations have demonstrated that lipoproteins 
are present in relatively large quantities in bacterial 
cells. For example, there are approximately 7.2 x 10 5 
molecules of the lipoprotein of the Escherichia coli 
outer membrane per cell. Moreover, since it appears 
that there is only one structural gene for the lipo- 
protein in the E. coli chromosome, its transcription 
machinery must be highly efficient. 

Recent efforts of Inouye and associate's have 
been directed to expression of lipoprotein using appro- 
priately formulated plasmids in suitably transformed 
microorganisms and to determining and analyzing DNA 
sequences of various lipoprotein genes (Ipp) . Thus, in 
Nakamura and Inouye, Cell 18_, 1109-1117 (1979) , the DNA. 
sequence for the outer membrane lipoprotein of E.- coli 
is reported. An analysis of the promoter region of 
this sequence demonstrated some interesting features. 
First, it was noted that the segment of 261 base pairs 
(bp) preceding the transcription initiation site (-1 
to -261) has a very high AT content (70S) in contrast 
to 53% for the 322 bp mRNA region, 44% for the segment 
of 127 bp after the transcription termination site and 
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49% for the average AT content of the E. coli chromo- 
some. Secondly, it was noted that the first 45 bp 
upstream from the transcription initiation site (-1 
to -45) contained 36 bases (80%) which are A or T. 
5 Thirdly, a heptanucleotide sequence analogous to the 
"Pribnow box" is present eight bases from the tran- 
scription initiation site. Fourthly, a sequence 
analogous to the "FNA polymerase recognition site" is 
present on both strands between positions -27 and -39. 
10 Fifthly, a long dyad symmetry is centered at the 
transcription initiation site. 

It is postulated by Inouye and associates 
that these features either separately or in combination 
are responsible for the high degree of lpp promoter 
15 strength. In particular, it is postulated that the 
high AT content in the promoter sequence tends to 
destabilize the helix structure of the DNA and thereby 
facilitates strand unwinding that is essential for 
initiation of transcription. 

The Inouye group further has shown that a 
high degree of homology exists with respesct to lipo- 
protein gene sequences of other, perhaps all, gram- 
negative bacteria. Thus, an analysis of the DNA 
sequence of the Serratia marcescens lipoprotein gene 
25 and comparison with that of the E. coli lp£ gene shows 
a high degree of homology. iNakamura and Inouye, Proc. 
Natl . Acad . Sci. USA 77, 1369-1373 (1980)].' In partic- 
ular, they showed that the promoter region is highly 
conserved (84% homologyl , having an extremely high A 
30 and T content (78%) just as in E. coli (80%) . More- 
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over, the 5* untranslated region of the lipoprotein 
mRNA is also highly conserved (95% homology) . 

More recently, in Yamagata, Nakamura, and 
Inouye, J. Biol . Chem. 256 , 2194-2198 (1981), the 

5 DNA sequence of the lipoprotein gene of Erwinia 
amylovora was analyzed and compared with those of 
E. coli and S. marcescens . This study again confirms 
the high degree of homology existing in the lpp genes. 
Thus, the promoter region (-45 to -1) is highly con- 

10 served (87% relative to E. coli and 93% to S. marcescens) . 
An extremely high A and T content (80%) exists, just as 
in E. coli (80%) and S. marcescens (78%) . Moreover, 
the sequence of the untranslated region of the mRNA is 
highly .conserved (97% relative to E. coli and 92% to S. 

15 marcescens ) . ~~ 

The high level of constitutive transcription 
observed for the lipoprotein gene, based upon Inouye «s 
studies, recommends it as a vehicle for expression of 
exogenous DNA fragments. Moreover, the work of Inouye 

20 Si 2i- suggest that any of a wide range of lipoprotein 
genes of gram-negative bacteria may be so employed, 
including, for example, Escherichia coli , Shigella 
dysenteriae , Salmonella typhimurium , Citrobacter 
f reundii , Klebsiella aerogenes . Enterobacter aerogenes , 

25 Edwardsiella tarda , Erwinia amylovora , Serratia 
marcescens , and the like. 

Most recently, the suitability of the lipo- 
protein gene for product expression has been demonstrated' 
by Inouye et al. (C. Lee, Nakamura, and Inouye, J. 

30 Bacter. 146, 861-866 (1981) . ' in this work the sT 
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marcescens lipoprotein gene was cloned in a lambda 
phage vector and then recloned in plasmid vectors 
pBR322 and pSClOl. Both vectors carrying the S. 
marcescens lpp gene were used to transform E. coli 
5 cells. The evidence establishes normal expression, 

albeit at a level somewhat reduced relative to vectors 
containing the E. coli lpp gene. In any event , it has 
been established in the literature that vectors con- 
taining the lpp gene promoter and 5 1 untranslated 
10 regions can be employed to achieve significant levels 
of lipoprotein expression. 

By the term "vector" as used herein is meant 
fi plasmid, phage DNA, or other DNA sequence (1) that is 
able to" replicate in a host cell, (2) that is able to 
15 transform a host cell, and (3) that contains a marker 
suitable for use in identifying transformed cells. 

There are two embodiments of the specific 
class of cloning vectors to which this invention 
relates. Signif icantfly high levels of expression of 
20 exogenous protein can be achieved using either embodi- 
ment of the cloning vectors. In the first embodiment 
the cloning vectors are constructed to contain, in 
tandem, a nucleotide sequence defining the lipoprotein 
promoter region, a nucleotide sequence defining the 
25 lipoprotein 5 1 untranslated region, and a sequence 

coding for an exogenous protein product, the sequence 
coding for such product being connected via a translation 
start signal codon to the 3* terminal of the 5 1 un- 
translated region of the lipoprotein gene. In the 
30 second embodiment the sequence coding for the exogenous 
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protein product is connected via the aforementioned 
start codon and a nucleotide sequence coding for an 
enterokinase cleavage site to the 3' terminal of the 5» 
untranslated region of the lipoprotein gene. 

Thus, this invention relates to a recombinant 
DNA cloning vector useful for expressing exogenous 
protein, which comprises 

(a) a DNA segment containing a functional origin 
of replication; 

(b) one or more DNA segments, each of which 
conveys to a transformable host cell a 
property useful for selection when said 
vector is transformed into said host cell? 
and 

(c) a DNA segment comprising a sequence that 
defines, in tandem, 

(1) the promoter of a lipoprotein expression 
control sequence, 

(2) the 5 1 untranslated region of a lipo- 
protein expression control sequence 
and 

(3) a translation start codon followed, 
without interposition of a portion or 
all of a nucleotide sequence coding 
for endogenous protein, by a sequence 
coding for an exogenous protein or 

by a nucleotide sequence coding for an 
enterokinase cleavage site to which is 
joined, without interruption, a nucleo- 
tide sequence coding for an exogenous 
protein. 
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The recombinant DNA cloning vector is pre- 
pared by linking DNA segments (a) , (b) and (c) . As 
noted, this invention is directed to DNA sequences and . 
recombinant DNA cloning vectors that are highly efficient 
5 in producing exogenous protein. Each of these employs 

at least a portion of a lipoprotein gene ( lpp) machinery, 
and, preferably, a lipoprotein gene from gram-negative 
bacteria. By the term "exogenous protein" as used 
herein is meant a protein product other than the lipo- 
10 protein molecule normally expressed by the lipoprotein 
gene machinery or any portion of such molecule. 

Examples of typical gram-negative bacteria 
which may serve as a source of lpp machinery are, 
for example, Escherichia coli . Shigella dysenteriae , 
15 Salmonella typhimurium , Citrobacter f reundii , Klebsiella 
Aerogenes , Enterobacter aerogenes , Edwardsiella tarda , 
Erwinia amylovora , Serratia marcescens , and the like. 

The lpp gene can be described in terms of 
five elements. In the order in which they appear in 
20 the gene, these elements are as follows: (1) the 
promoter region; (2) the 5' untranslated region; 
(3) the lipoprotein coding sequence; (4) the 3" un- 
translated region; and (5} the transcription termination 
site. 

25 The function of each of these elements in 

gene systems is well recognized. The promoter region 
mediates initiation of messenger RNA (mRNA) production 
(transcriptionl . * The promoter may be free of external 
control (constitutive), under the control of a repressor, 

30 a substance that, when present, represses gene function, 
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or under the control of an inducer , a substance that is 
required to induce gene function. The Ipp gene is free 
from external control and thus is termed "constitutive". 
Located at or near the promoter is the 

5 "transcription initiation site", a point at which RNA 
polymerase binds to initiate transcription of mRNA. 
Once transcription is initiated, mRNA is produced. The 
structure of the resulting mRNA is determined by the 
DNA sequences of the gene elements (2) to (4) above. 

10 The resulting mRNA carries a sequence which 

is translatable into protein product. The translatable 
sequence is located downstream of the 5' untranslated 
region and upstream of the 3' untranslated region. " 
Translation is mediated by binding of ribosomes to a 

15 sequence in the mRNA 5' untranslated region denoted as 
the ribosome binding site and is initiated at the 
translation start codon (AUG) appearing as the first 
codon of the product gene sequence and coding as well 
for the amino acid methionine (Met) . Translation 

20 terminates at one or more termination codons appearing 
at the end of the translation region. 

By the techniques of recombinant DNA, it has 
become possible to prepare cloning vectors useful for 
the production of foreign (exogenous) proteins by 

25 inserting into such vectors an expression control 

sequence, i.e., a sequence of nucleotides that controls 
and regulates expression of structural genes when 
operatively linked to those genes. In the subject 
matter of this invention, the cloning vectors involve 

30 use of a portion or all of the lpp expression control 



0095361 



X-5872 -8- 

seguence, which includes elements (1) , (2), (4) r and 
(5) as aforedescribed. Of these four elements, in the 
cloning vectors of this invention, only elements (1) 
and (2), the promoter region and the 5' untranslated 

5 region are required. 

It has been customary, using recombinant DNA 
methodology, to produce a foreign protein by inserting 
a DNA sequence coding for such foreign protein into the 
expression control sequence of a cloning vector at a 

10 point such that the product expressed comprises a 

hybrid protein. By "hybrid protein" as used herein is 
meant a recombinant DNA product comprising all or a 
portion of the natural (endogenous) protein produced by 
the expression control sequence (in this case, lipo- 

15 protein) to which is attached the foreign (exogenous) 
protein. 

The properly designed hybrid protein will 
contain a cleavage site at the junction of the endo- 
genous protein portion and the exogenous protein. The 

20 cleavage site permits generation of mature exogenous 
protein product by chemical or enzymatic treatment of 
the hybrid protein product. 

As noted hereinbefore, it has been determined 
that the lpp expression control sequence is useful for 

25 expression of exogenous proteins. Most recently, 

however, it has been discovered that the lpp expression 
control sequence can be used to great advantage to 
express exogenous protein when the construction is 
designed such that a DNA segment comprising a sequence 

30 that defines, in tandem, the promoter and the 5* un- 
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translated region of a lipoprotein expression control 
sequence and a translation start codon followed, without 
interposition of a portion or all of a nucleotide 
sequence coding for endogenous protein, by a sequence 

5 coding for exogenous protein or by a nucleotide 

sequence coding for an enterokinase cleavage site tc 
which is joined, without interruption, a sequence 
coding for exogenous protein. This is in contradis- 
tinction to a hybrid protein comprising lipoprotein or 

10 a portion thereof and exogenous protein. 

In constructing the cloning vectors to which 
this invention relates, several elements are required. 
Two of the required elements are common to all useful 
cloning vectors. First, the vector must have a DNA 

15 segment containing a functional origin of replication 
(replicon) . Plasmids and phage DNA by their very 
nature contain replicons facilitating replication in a 
host cell. 

Secondly, the vector must have a DNA segment 
20 which conveys to a transformable host cell a property 
useful for selection of transformed cells from non- 
transformed cells. Any of a wide range of properties 
can be used for selection purposes. One of the most 
commonly used properties is antibiotic resistance, 
25 e.g., tetracycline resistance or ampicillin resistance. 

The foregoing two elements generally are 
present in readily available and recognized cloning 
vectors. Examples of suitable cloning vectors are 
bacterial plasmids, such as plasmids from E. coli , 
30 including pBR322, pMB89, ColEl, pCRl; wider host range 
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plasmids, including RP4; phage DNAs, such as lambda , 
and the like. Most, if not all, of the above-recognized 
vectors already carry the aforedescribed two elements* 
A third element, specific to the vectors to 
5 which this invention relates, is the lipoprotein ex- 
pression control sequence. The E. coli lipoprotein 
expression control sequence, present in plasmid pKENlll 
and cultured in E. coli CC620, has been deposited and 
made a part of the stock culture collection of the 
10 Northern Regional Research Center, Agricultural Research, 
North Central Region, 1815 North University Street, 
Peoria, Illinois, 61604, from which it is available to 
the public under the accession number NRRL 15011. The 
lipoprotein expression control sequence can be removed 
15 from pKENlll using recognized restriction sites and 

their corresponding restriction endonucleases. Any of 
a wide range of other lipoprotein expression, control 
sequences also are available using recognized meth- 
odology. Such methods may involve, for example, prep- 
20 aration by synthesis or by isolation of a probe using 
available Ipp sequences (e.g. pKENlll), and, taking 
advantage of the high degree of homology which exists 
between Ipp sequences, using such probe for selecting, 
by hybridization, Ipp sequences from other sources. 
25 In producing a suitable cloning vector by 

insertion of the lipoprotein expression control sequence, 
routine methods also are used. Various sites exist 
within cloning vectors at which cuts can be made using 
a restriction endonuclease specific for such site. Any 
30 of these sites can be selected for insertion of the 
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lipoprotein expression control sequence. As an example , 
in the well-recognized and documented plasmid pBR322, 
several suitable restriction sites exist, any of which 
may be employed as insertion sites. A PstI site is 

5 located within the gene for 3-lactamase. Other sites 
outside of any specific coding region are EcoRI and 
PvuII. These and other sites are well recognized by 
those skilled in the art. 

Taking advantage of any of these sites or 

10 others, insertion of a lipoprotein expression control 
sequence or the essential portion thereof can be 
readily accomplished in production of vectors defined 
herein . 

A fourth element, again specific to the 

15 vectors to which this invention relates, is the DNA 
sequence coding for the exogenous protein. The key 
requirement with respect to the exogenous protein DNA 
sequence in the vectors of this invention concerns its 
location. It must be located downstream of the 3' end 

20 of the 5' untranslated region of the lipoprotein ex- 
pression control sequence and in connection therewith 
via a translation start codon followed by a nucleotide 
sequence which codes for an enterokinase cleavage site. 
Necessarily, in the vectors of this invention, none of 

25 the DNA sequence coding for lipoprotein may be inter- 
posed between the 5* untranslated region and the sequence 
coding for exogenous protein. 

A fifth element, specific to the embodiment 
of the vectors to which this invention relates in which 

30 the translation codon is followed by a nucleotide 
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sequence coding for an enterokinase cleavag site to 
which is joined, without interruption, a sequence coding 
for exogenous protein. The amino acid sequence of the 
aforementioned cleavage site is recognized and cleaved 
5 at its'carboxyl terminal by the enzyme enterokinase. 
The nucleotide sequence coding for an enterokinase- 
cleavable amino acid sequence is joined at its 5' end 
to the translation start codon and at its 3' end to the 
5 f end of the nucleotide sequence coding for the 
10 exogenous protein and is designed such that the resulting 
translation product comprising (methionine) - (enterokinase 
cleavage site) - (exogenous protein) can, by treatment 
with enterokinase, be cleaved with production of mature 
exogenous protein. 
15 Enterokinase (3.4.21.9) has been described as 

"one of many hydrolases located in the brush border 
membrane of the intestinal duodenum." (J.J. Liepnieks 
and A. Light, J. Biol . Chem . 254 , 1677-1683 (1979)). 
Its isolation and purification has been described in 
20 numerous publications, see, for example, Liepnieks, 
supra ? S. Maroux, J. Baratti, and P. besnuelle, J. 
Biol . Chem . 246 , 5031-5039 (1971), and J. Baratti, S. 
Maroux, D. Louvard, and P. Desnuelle, Biochimica et 
Biophysica Acta 315 , 147-161 (1973) . 
25 Enterokinase appears to cleave a peptide at 

the carboxyl of a lysine (Lys) residue that is preceded 
by a multiplicity of acidic amino acids, i.e., glutamic 
acid (Glul and/or aspartic acid (Asp) . Thus, in A. 
Light, H.S. Savithri, and J.J. Liepnieks, Anal . Biochem. 
30 106 , 199-206 (1980), a number of amino acid sequences 
recognized by enterokinase are described, including 
many of the following: 
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10 



15 



20 



25 



30 



Phe-Pro~Leu- Asp-Asp-Asp- Asp-Lys ; 
Val-Asp-Asp-Asp-Asp-Ly s ; 
Phe-Pro-Ile-Glu-Glu-Asp-Lys ; 
Leu-Pro-Leu-Glu-Asp-Asp-Lys ; 
Ala-Asp-Asp-Lys; 
Asp-Asp-Asp-Asp-Ly s ; 

and the like. 

The nucleotide sequences coding for any of 
the above as well as others can be present in the 
cloning vectors to which this invention relates. The 
only requirement is that the nucleotide sequence be one 
which codes for an amino acid sequence that is rec- 
ognized by and, when present in a longer chain peptide, 
cleaved at its carboxyl terminal by enter okinase. 

In construction of vectors meeting these 
requirements, advantage can be made of a unique Xbal 
restriction site that appears in the 5' untranslated 
region of the E. coli lipoprotein expression control 
sequence. A cut can be made at the Xbal site with 
removal of a portion of the 5' untranslated region. 
Using recognized oligonucleotide synthesis methodology, 
a linker can be prepared comprising the removed portion 
of the 5' untranslated region to which is coupled a DNA 
sequence coding for a start codon or the start codon 
followed by the enterokinase cleavage site, and a 
portion or all of the exogenous protein. 

The DNA sequence coding for exogenous protein 
can be constructed synthetically, e.g., using the 
recognized phosphotriester method or other well- 
recognized methods, or its DNA sequence can be obtained 
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by recognized methodology as a copy from isolated mRNA. 
Once so obtained, the cDNA copy can be cut at a re- 
striction site located at a point as near the start 
codon as is available. In the first embodiment of 

5 the cloning vectors to which this invention relates, 
a linker composed of the lipoprotein 5 f untranslated 
region fragment removed by the Xbal cleavage followed 
by the cleaved port ion , including start codon , of the 
exogenous protein, can be prepared synthetically. In 

10 the second embodiment of the cloning vectors, a linker 
composed of the lipoprotein 5' untranslated region 
fragment removed by the Xbal cleavage followed by start 
codon, enterokinase cleavage site, and the cleaved 
portion' of the exogenous protein, can thus be prepared 

15 synthetically. These linkers, sufficient to bridge the 
gap, then are used in conjunction with remaining avail- 
able elements of the lipoprotein expression control 
sequence to prepare a vector. 

The cloning vectors to which this invention 

20 relates can be used to produce any of a wide range of 
exogenous proteins, including mammalian and human 
hormones, enzymes, and immunogenic proteins (or inter- 
mediates therefor). Examples of such products are 
insulin A chain, insulin B chain, proinsulin, inter- 

25 ; feron, growth hormone, antigenic proteins for foot and 
mouth disease, somatostatin, p-endorphin, and the like. 
Preferred cloning vectors are those designed for the 
production of human growth hormone or bovine growth 
hormone. It will be recgonized that the expression 

30 product of the first embodiment of the cloning vectors 
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will contain a methionine (Met) at their amino terminal 
by reason of the presence of the start codon. Expression 
product of vectors of the second embodiment will comprise 
methionine (start codon) , enterokinase cleavage site, 

5 and exogenous protein. Mature exogenous protein is 
generated by treating the latter expression product 
with enterokinase in accordance with recognized meth- 
odology (see, for example. Light et al., supra ) . 

The cloning vectors to which this invention 

10 relates can be used in a wide range of host organisms, 
for example, gram-negative prokaryotic organisms such 
as Escherichia coli , Serratia , Pseudomonas , and the 
like; gram-positive prokaryotic organisms, such as 
Bacillus , Streptomyces , and the like; and eukaryotic 

15 organisms, such as Saccharomyces , and the like. Pref- 
erably, the host organism is a gram-negative prokaryotic 
organism. Of gram-negative prokaryotic organisms, E. 
coli is especially preferred, for example, E. coli 
K-12 strains, such as RV308. 

20 Employing well recognized methodology, the 

cloning vectors are used to transform suitable host 
organisms, are amplified in such organisms, and exo- 
genous protein product is expressed using standard 
fermentation conditions. The exogenous protein product 

25 is isolated by routine methods from the resulting 
fermentation broth. 

The structure and function of cloning vectors 
to which this invention relates is illustrated by the 
examples which follow, which examples are to be read 

30 and understood in conjunction with the accompanying 
drawings in which: 
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Figures 1-4 together comprise a schematic 
illustration of the preparation of intermediates and 
starting material useful in the construction of both 
embodiments of the cloning vectors to which this 

5 invention relates. 

Figure 5 taken in conjunction with Figures 
1-4 comprises a schematic illustration of a method 
as described in Example 1 following for constructing 
a cloning vector useful for the production of methionyl 

10 human growth hormone. 

Figures 6-8 together and in conjunction with 
Figures 1-5 comprise a schematic illustration of a 
method as described in Example 2 for constructing a 
cloning vector useful for the production of methionyl 

15 bovine growth hormone. 

Figure 9, in conjunction with Figures 1-5, 
comprises a schematic illustration of a method as 
described in Example 3 for constructing a cloning 
vector which is a variant of the cloning vector 

20 described in Example 1 and which is useful for the 
production of methionyl human growth hormone. 

Figure 10 taken in conjunction with Figures 
1-4 comprises a schematic illustration of a method as 
described in Example 4 for constructing a second em- 

25 bodiment of the cloning vectors to which this invention 
relates useful for the production of human growth 
hormone • 

Figures 11-13 together and in conjunction 
with Figure 10 and Figures 1-4 comprise a schematic 
30 illustration of a method as described in Example 5 
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for constructing a cloning vector of the second embodi- 
ment for the production of bovine growth hormone. 

Preparation — Intermediates and Starting 
Material Common to the Construction of the First and 
5 Second Embodiments of Plasmids to which the Invention 
Relates — 

The ^5.1kb (kilobase) fragment produced by 
Xbal (5 , TCTAGA3 I ) , BamHI (5 , GGATCC3 t ) cleavage of 
10 plasmid vector pKEN021 (106 in Figure 3) was used as 
starting material. pKEN021 is a derivative of pKENlll 
(101 in Figure 1) (Lee, N., et al. , J. Bact. 146 , 
861-866 (1981) and Zwiebel, L. J., et al M J* Bact . - 
145 , 654-656 (19811, which is on deposit in E. coli 
CC620 (NRRL Deposit No. 15011) . Plasmid pKENlll has a 
2.8kb fragment which contains the lipoprotein gene of 
E. coli . A description of this fragment is provided in 
Nakamura, K. and Inouye, M., Cell 18 , 1109-1117 (1979). 
In pKEN021 the 650 bp (base pair) sequence between the 
2Q unique EcoRI (5 'GAATTC3 1 ) and Sail (5'GTCGAC3') restric- 
tion sites of pBR322 has been replaced by sequences 
taken from the lipoprotein gene of E. coli . The 
nucleotide sequence of all functional parts of this 
gene has been determined. The lipoprotein gene sequence 
25 (Nakamura, K. and Inouye, M. , Cell 18, 1109-1117 (1979)) 
includes a 462 bp Alul (5 , AGCT3') fragment upstream of 
the first codon (methionine) of the lipoprotein gene. 
This fragment contains the promoter, the 5' untrans- 
lated region and the ribosome binding site. A unique 
30 XbaI < 5,TCTAGA3, 1 restriction site is located within 
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the ribosome binding site 16 bp before the translation 
initiating methionine codon. A PvuII (S'CAGCTGS 1 ) re- 
striction site located 105 bp upstream of the trans- 
lation termination codon of the structural gene was 

5 changed to a BaroHI (5'GGATCC3 1 ) restriction site 

by the addition of a synthetic DNA adapter fragment, 
(5 1 CCGGATCCGG3 1 , obtained from Collaborative Research). 
The coding sequence for the last thirty-five amino 
acids of lipoprotein, the translation termination 

10 codon, and the sequence corresponding to the 3* un- 
translated region of the messenger RNA follow the BamHI 
site. Plasmid pKEN021 also includes some 850 bp of 
extraneous sequences unrelated to the lipoprotein gene 
and located downstream of it in the E. coli chromosome. 

15 These sequences were included as a consequence of the 
methods and restriction enzyme sites used in the 
original isolation of the gene. 

Referring to Figures 1, 2, and 3, plasmid 
PKEN021 is derived from pKENlll in the following 

20 manner: Fifty micrograms of pKENlll (101 in Figure 1) 
are digested with 25 units of restriction enzyme Hpall 
(S'CCGGS'l in 300 yl of a buffer containing 20mM 
Tris:HCl pH 7.4, lOmM MgCl 2 # and 6mM fJ-mercaptoethanol 
at 37 °C. for 2 hours. The mixture is extracted twice 

25 with 300 yl of a 50:50 mixture of phenol and chloroform, 
and the recovered aqueous phase is precipitated with 
2.5 volumes of ethanol. The DNA pellet is dissolved 
in 100 yl of electrophoresis buffer and fractionated 
on a 5 percent polyacrylamide gel ( aery lamide : bis ratio 

30 is 29:1 in all gels except where noted). The gel is 
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stained in a solution containing 0.5 yg/ml of ethidium 
bromide and bands are visualized under long wave-length 
ultraviolet light. A 950 bp band is isolated and 
recovered from the gel by electroelution into a dialysis 
5 bag. After phenol/CHCl 3 extraction and ethanol pre- 
cipitation the recovered DNA (approximately 2.5 yg) is 
dissolved in 25 yl of TEN (lOmM NaCl, lOmM Tris:HCl 
pH 7.4 and ImM sodium ethylenedinitrilo tetraacetate 
(EDTA) pH 8.0). 

10 Two micrograms of the 950 bp Hpall fragment 

are digested with restriction enzyme Alul (5 f AGCT3 9 ) in 
200 yl of a buffer containing 50mM NaCl, 6mM Tris:HCl 
(pH 7.6), 6mM MgCl 2 , and 6mM 3-mercaptoethanol for 2 
hours at 37 °C. The DNA is fractionated on a 6 percent 

15 polyacrylamide gel, and the 462 bp Alul fragment gen- 
erated is recovered and purified by the method here- 
inbefore described. The 462 bp Alul fragment (approxi- 
mately 1 yg) is dissolved in 10 yl of T 4 DNA ligase 
buffer (66mM Tris:HCl pH 7.6 # lOmM MgCl 2 , lOmM di- 

20 thiothreitol, 0.4mM ATP) containing 150 picamoles of 
phosphorylated EcoRI linker ( 5 • GGAATTCC3 • from Col- 
laborative Research) and 2 units Tj DNA ligase. After 
incubation at 4*C. for 16 hours the mixture is heated 
at 65 °C. for 10 minutes and diluted to 100 yl with the 

25 addition of EcoRI buffer (lOOmM Tris:HCl pH 7.2, 50mM 
NaCl, lOmM MgCl 2 , 6mM 3-mercaptoethanol) and 40 units 
EcoRI enzyme. After 2 hours at 37 °C. the sample is 
phenol/CHCl 3 extracted and ethanol precipitated by the 
method hereinbefore described. The DNA is dissolved in 

30 20 yl of T 4 DNA ligase buffer containing 0.1 unit T A 



X-5872 



-20- 



0095361 



DNA ligas and 0.1 yg pBR322 (102 in Figure 1) which 
has been linearized with EcoRI and alkaline phosphatase 
treated to remove end phosphates. After ligation at 
4°C. for 16 hours the material is used to transform a 
5 suitable E. coli strain (hsr~, hsm + ) such as HB101. 
The bacterial cells are made competent for transfor- 
mation using a standard CaCl 2 treatment. Transformants 
are selected on agar plates containing 12 yg/ml of 
tetracycline. Plasmids "are isolated from several 
10 . tetracycline resistant colonies by the rapid alkaline 
: extraction procedure described in Birnboim, H.C. and 
Doly, J., Nucleic Acids Research 7, 1513-1523 (1979). 
A plasmid (103 in Figure 1) containing a 466 bp Xbal, 
BamHI fragment (desired orientation) is selected and 
15 used as the starting material for the next step. 

Two micrograms of this plasmid (103 in Figure 2) 
(having one Hindlll (S'AAGCTTS') restriction site) are 
digested with 2 units of Hindlll enzyme in 50 yl 
Hindlll buffer (60mM NaCl, lOmM Tris:HCl pH 7.4, lOmM 
20 **9 C1 2 and 6mM P~ merca P toet kanol) for * hour at 37 °C. 

After phenol/CHCl 3 extraction and ethanol precipitation 
the DNA is dissolved in 200 yl of a buffer containing 
300mM NaCl, 30mM sodium acetate pH 4.25, ImM ZnCl 2 and 
200 units of SI nuclease (Miles Laboratories) which is 
25 specific for single stranded DNA. After 1 hour at 

15°C. the reaction is stopped by phenol/CHCl 3 extraction 
and ethanol precipitation. The plasmid, which has now 
had the single stranded, Hindlll-generated ends re- 
moved, is dissolved in 10 yl Tj DNA ligas e buffer 
30 containing 20 picamoles phosphorylated BamHI linkers 
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10 



15 



20 



25 



30 



( 5 • CCGGATCCGG3 ■ , from Collaborative Research) and 2 
units T 4 DNA ligase. After 16 hours at 4°C. the reac- 
tion mixture is heated at 65 °C. for 10 minutes to 
inactivate the ligase. The mixture is diluted to 
100 pi in BamHI buffer (150mM NaCl, 20mM TristHCl pH 
8.0, lOmM MgCl 2 , 6mM 3-mercaptoethanol) containing 20 
units BamHI enzyme. After 2 hours at 37 °C. the mixture 
is purified on a 1 percent agarose gel. The gel is 
stained and the larger fragment (4.5kb) is recovered by 
elution after freezing and purified by phenol/CHClj 
extraction and ethanol precipitation. The recovered 
plasmid with BamHI cohesive ends is dissolved in 20 yl 
of T 4 DNA ligase buffer containing 0.1 unit Tj DNA 
ligase. After 16 hours at 4°C. the DNA is used to 
transform E. coli HB101. Transf ormants are selected by 
resistance to ampicillin (Ap r ) at 100 pg/ml and screened 
for sensitivity to 10 ug/ml tetracycline (Tc S )-. Sev- 
eral plasmids are prepared by the previously described 
Bimbo iro procedure from colonies which are Ap r Tc s . 
These are examined for the absence of a Hindlll site 
and the presence of a single BamHI site. EcoRI, Sail 
sequential digestion yields a 466 bp and a 305 bp 
fragment. A plasmid (104 in Figure 2) with thejse 
characteristics is selected and is modified to remove 
the EcoRI site located upstream of the lpp promoter and 
to convert it to a Hindlll restriction site. 

Two micrograms of plasmid (104 in Figure 2) 
are digested in 100 pi of EcoRI buffer with 0.2 units 
of EcoRI for 10 minutes at 37 °C. The reaction is 
stopped by heating for 10 minutes at 65°C. After 
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phenol/CHCl 3 extraction the DNA is ethanol precipitated 
and dissolved in 200 yl of SI nuclease buffer contain- 
ing SI nuclease at 1000 units/ml. After 1 hour at 
12 °C. the reaction is stopped by phenol/CHCl 3 extrac- 

5 tion and ethanol precipitation* The DNA is resuspended 

in 10 yl of T 4 DNA ligase buffer containing 20 pica- 
moles phosphorylated Hindlll linker (5 1 CCAAGCTTGG3 1 , 
from Collaborative Research) and 2 units of T 4 DNA 
ligase. After 16 hours at 4°C. the ligase is inacti- 

10 vated by heating 10 minutes at 65 °C. The reaction 

mixture is diluted to 150 yl in Hindlll buffer contain- 
ing 10 units Hindlll enzyme. After incubation for 2 
hours at 37°C, the mixture is fractionated on a 1 
percent agarose gel. After staining in ethidium bro- 

15 mide, the largest band (equivalent to single cut plas- 
mid) is recovered and purified. The plasmid is dis- 
solved in 20 yl T 4 ligase buffer containing 0.2 units 
T 4 ligase, incubated 16 hours at 4°C. and used to 
transform E. coli HB101. Transformants are selected 

20 for ampicillin resistance and are screened by the 

Birnboim procedure. Plasmid isolates are analyzed by 
restriction with EcoRI (1 site) and Hindlll (1 site) 
enzymes. A plasmid (105 in Figure 2) with an EcoRI, 
Hindlll fragment of 500 bp is selected and used as the 

25 cloning vector for addition of the 3 f region of the lpp 
gene. 

Two micrograms of plasmid (105 in Figure 3) 

are digested in 50 yl of Sail restriction buffer (150mM 

NaCl, 6mM TristHCl pH 7.9, 6mM MgCl 9 , 6mM p-mercapto- 
30 * 
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ethanol) with 2 units of Sail for 1 hour at 37°C. The 
reaction is diluted to 150 ul in BamHI buffer con- 
taining 2 units BamHI. After 1 hour at 37 °C, 2.5 
units of alkaline phosphatase are added and incubation 
continued for 1 hour at 65 °C. The material is phenol/- 
CHC1 3 extracted, ethanol precipitated, dissolved in 
TEN, and used as cloning vector for the lpp 3* fragment. 

To obtain the fragment containing the lpp 3' 
region, 10 ug of pKENlll (101 in Figure 3) are digested 
in 200 pi of Hpal buffer (20mM KC1, lOmM Tris:HCl pH 
7.4, lOmM MgCl 2 and 6mM 0-mercaptoethanol) with 10 
units of Hpal ( 5 * GTTAAC3 1 ) for 2 hours at 37°C. After 
phenol/CHCl 3 extraction and ethanol precipitation, the 
DNA is dissolved in 10 ul T 4 DNA ligase buffer contain- 
ing 20 picamoles phosphorylated Sail linker 
(5'GGTCGACC3' , from Collaborative Research) and 2 units 
T 4 DNA ligase. After 16 hours at 4°C. the ligase is 
inactivated by heating at 65°C. for 10 minutes* The 
material is diluted to 100 yl in Sail buffer containing 
10 units of Sail and incubated 1 hour at 37 °C. The DNA 
is diluted to 300 ul in PvuII buffer (60mM NaCl, 6mM 
TristHCl, P H 7.5, 6mM MgCl 2 , 6mM B-mercaptoethanol) 
containing 10 units PvuII restriction enzyme. After 1 
hour at 37 °C. the DNA is fractionated on a 5 percent 
polyacrylamide gel. Approximately 0.5 ug of a 950 bp 
fragment is recovered, purified and dissolved in TEN. ' 
Two-tenths microgram of fragment is diluted into 20 yl 
T 4 DNA ligase buffer containing 20 picamoles phos- 
phorylated BamHI linker (S'CCGGATCCGGS' , from Col- 
laborative Research) and 2 units T 4 DNA ligase. After 
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16 hours at 4°C the ligase is inactivated by heating 
10 minutes at 65°C. The DNA is diluted to 100 yl in 
BamHI buffer containing 20 units BamHI. After 2 hours 
at 37 °C. the DNA is fractionated on a 5 percent poly- 

5 acrylainide gel to remove excess linker molecules* The 

950 bp fragment having BamHI and Sail cohesive ends is 
recovered and purified* The fragment is dissolved in 
20 pi of T 4 DNA ligase buffer containing 0.2 yg of 
cloning vector described previously and 0.2 units T 4 

10 DNA ligase. After incubation for 16 hours at 4°C. the 
material is used to transform E. coli HB101. Plasmids 
are prepared from ampicillin resistant transformants 
and analyzed for a Sail, BamHI fragment of 950 bp. The 
desired plasmid (5.2kb) is designated pKEN021 (106 in 

15 Figure 3) . 

Ten micrograms of pKEN021 were digested in 
200 yl of Xbal/BamHI buffer (150mM NaCl, lOmM Tris:HCl 
pH 8, lOmM MgCl 2 , 6mM 0-roercaptoethanol) using 10 units 
of BamHI for 1 hour at 37 °C. followed by 10 units of 

20 xbal for 1 hour at 37 °C. The DNA was then treated with 
2.5 units of alkaline phosphatase for 1.5 hours at 
65°C. , phenol/CHCl 3 extracted, collected by ethanol 
precipitation, and dissolved in 50 yl of TEN (lOmM 
Tris:HCl pH 7.4, lOmM NaCl, ImM EDTA) for 0.2 yg/yl. 

25 This preparation (107 in Figure 3) was used as the 
plasmid cloning vector. 

Plasmid ptrpED50chGH800 (108 in* Figure 4), 
described in Martial, J. H. , et al.. Science 205 , 
602-607 (1979) , was used as the source of a DNA frag- 

30 ment containing the coding sequence for a portion of 
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the human growth hormone gene. This fragment also is 
available using recognized methodology for isolating 
mRNA coding for human growth hormone from human pitui- 
taries. Such methodology is described by Goodman, H. 
M. , et al., Methods in Enzymology 68 , 75-90 (1979). 
The human growth hormone gene portion of plasmid 
ptrpED50chGH800 contains a unique Smal (5'CCCGGG3') 
restriction site 6 bp downstream from the translation 
termination codon of the gene. This site was changed 
to a BamHI site using the following procedure: 6 pg of 
the plasmid were digested with 6 units of Smal in 
200 pi of Smal restriction buffer (15mM Tris:HCl pH 
8.0, 6mM MgCl 2 , 15mM KCl and 6mM 3-mercaptoethanol) for 
1.5 hours at 37 °C. After digestion was complete, 
phenol/CHCl 3 extraction was performed, and the DNA was 
recovered by ethanol precipitation. The precipitated 
DNA was dissolved in 24 pi of TEN. Forty picamoles of 
phosphorylated BamHI adapter fragment (Collaborative 
Research) were added to 0.5 pg (0.2 picamole ends) of 
the above digested plasmid in 16 pi of ligase buffer 
containing 2 units T 4 DNA ligase. Ligation was allowed 
to occur 2 hours at 22 °C. and 16 hours at 4°C. T 4 DNA 
ligase was inactivated at 65°C. for 10 minutes. BamHI 
cohesive termini were generated by dilution into BamHI 
buffer containing 20 units BamHI enzyme in a final 
total volume of 40 pi followed by incubation at 37 °C. 
for 1 hour. The enzyme cleaved the linker sequence as 
well as a BamHI site located at the beginning of the 
cloned cDNA sequence of human growth hormone. This 
yielded a 691 bp fragment with cohesive BamHI ends 
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which was separated on a 6 percent polyacrylamide gel 
and visualized under long wavelength ultraviolet light 
after staining in an ethidium bromide solution at 
1 ug/ml. The gel region containing the fragment was 
5 excised and the DNA fragment was recovered by electro- 
elution into a dialysis bag followed by ethanol pre- 
cipitation* The precipitated DNA was recovered by 
centrifugation, dissolved in TEN, phenol/CHCl 3 ex- 
tracted to remove ethidium bromide and ethanol pre- 
10 cipitated. The recovered DNA fragment was ligated 

(using 0.2 unit T 4 DNA ligase in 20 pi of buffer under 
previously described conditions) with 0.2 yg pBR322 
(102 in Figure 4) which had been cleaved at its unique 
BamHI site and treated with alkaline phosphatase. 
15 After 16 hours at 4°C. the material was used to trans- 
form E- coli strain JA221 (recA~, hrs~hsm* , AtrpES , 
thr, leu , thi , lacY~) which is on deposit as NRRL 
Deposit No. 15014. A transformation procedure as 
described by Wensink, P. C. et al • , Cell 3, 315-325 
20 (1974) was used, and transformed colonies were selected 
on agar plates containing 100 yg/ml ampicillin. 
Plasmid DNAs were isolated from sixteen of the ampi- 
cillin resistant colonies by the rapid alkaline- 
denaturation method previously described by Birnboim 
25 and then analyzed by restriction enzyme digestion and 
gel electrophoresis. Eleven of the sixteen plasmids 
examined were found to contain a BamHI fragment of 
approximately 700 bp. One of these plasmids pNM575 
(109 in Figure 4) was chosen for amplification to use 
30 as a source of DNA fragment for the plasmid construe- 
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tion to be described. The DNA sequence of mature human 
growth hormone contains one FnuDII (S'CGCGS 1 ) site 
which is 47 bp from the first nucleotide. There are 23 
recognition sites for this enzyme in pBR322. Twenty- 
five micrograms of pNM575 were digested in 250 yl of 
BamHI buffer with 25 units of BamHI at 37 °C. for 1 
hour. The 691 bp fragment with BamHI cohesive termini 
was isolated from a 6 percent polyacrylamide gel and 
purified by procedures described above. After purifi- 
cation of the fragment one third of it (equivalent to 
8 iig of plasmid) was digested in 100 yl of FnuDII 
buffer (6mM NaCl r 6mM Tris:HCl pH 7.4, 6mM MgCl 2# 6mM 
3-mercaptoethanol) with 2.5 units FnuDII for 1.5 hours 
at 37 °C. Electrophoresis on a 6 percent polyacrylamide 
15 g e i an( i standard recovery procedures were used to 

isolate a 538 bp DNA fragment containing the coding 
sequence for the last 175 amino acids of the gene 
followed by a translation stop codon. 



10 



20 



Example 1 — Plasmid for the Expression of Methionyl 
Human Growth Hormone Using the Lipoprotein Promoter of 
E. coli 

A. Construction 

A double stranded DNA fragment (110 in Fig- 
2 ^ ure 5) was synthesized by the phosphotriester method to 
join the lpp promoter region with the human growth 
hormone coding region for direct expression of human 
growth hormone* The upper strand has 66 nucleotides 
which includes on the 5' end the 4 nucleotide single 
3 ^ stranded sequence produced by Xbal cleavage. The lower 
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strand has 62 nucleotides which are complementary to 
th last 62 nucleotides of the upper strand. The first 
part of the synthetic DNA fragment follows the natural 
sequence of the lpp gene from the Xbal restriction site 
5 in the ribosome binding site through the translation 

initiating methionine codon (19 bp) and is followed by 
the sequence for the first 47 nucleotides of human 
growth hormone to the unique FnuDII site previously 
described*. 

10 The double stranded DNA fragment (110 in 

Figure 5) has the following structure: 

Xbal 

5* CTAGAGGGTATTAATAATGTTCCCAACCATTCCCTTATCC- 
3 1 TCCCATAATTATTACAAGGGTTGGTAAGGGAATAGG- 

15 AGGCTTTTTGACAACGCTATGCTCCG 3 1 • ThaI 

TCCGAAAAACTGTTGCGATACGAGGC 5 • 

The fragment was prepared by recognized phos- 
photriester methodology by which the following segments 
2Q were prepared: 



25 



30 



1) 


CTAGAGGGTAT 


2) 


TAATAATGTTCC 


3) 


CAACCATTCCC 


4) 


TTATCCAGGC 


55 


TTTTTGACAACG 


6i 


CTATGCTCCG 


7) 


CATTATTAATACCCT 


8) 


GGTTGGGAA 


9) 


GGATAAGGGAAT 


10) 


GTCAAAAAGCCT 


11) 


CGGAGCATAGCGTT 
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Using the above-prepared segments, the fol- 
lowing three duplexes were prepared. 

a) Segment 1 (5'-unphosphorylated) was ligated 
to 5 '-phosphorylated segment 2 in the presence of 

5 '-phosphorylated segment 7 using T 4 ligase to produce 
duplex 1 by following the established procedure 
[E. L. Brown, R. Belagaje, M. J. Ry an and H. G. Khorana, 
Methods in Enzymology 68, 109-151 (1979)]. The duplex 
was isolated by preparative gel electrophoresis on 
15% polyacrylamide. 

b) 5 '-Phosphorylated segment 3 was ligated 
to 5 '-phosphorylated segment 4 in the presence of 5'- 
phosphorylated segments 8 and 9 using T 4 ligase to 
produce duplex 2 which was isolated by preparative gel 
electrophoresis on 15% polyacrylamide. The reaction 
was performed as described above. 

c) 5 • -Phosphorylated segment 5 was ligated 
to 5 '-phosphorylated segment 6 in the presence of 

5 '-phosphorylated segment 10 and 5'-unphosphorylated 
segment 11 using T 4 ligase as described above to 
produce duplex 3. The duplex was isolated by prepara- 
tive gel electrophoresis on 15% polyacrylamide. 

Duplexes 1, 2, and 3 then were joined by T 
ligase to produce the double stranded DNA segment (110, 
in Figure 5) which was isolated by preparative gel 
electrophoresis on 15% polyacrylamide. This product 
was then enzymatically phosphorylated at its 5 '-ends 
using T 4 polynucleotide kinase and [Y-p 32 ]ATP by fol- 
lowing the established procedure. 
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The expression plasmid was constructed by 
enzyroatically joining 0.1 picamol (0.4 yg) plasmid 
vector (107 in Figure 5) , 3.2 picamoles synthetic DNA 
fragment (110 in Figure 5) and 0.24 picamoles (0.08 yg) 

5 of 538 bp fragment (109 in Figure 5, see Preparation) 

in 14 yl of ligation buffer using 2 units T 4 DNA ligase. 
After incubation for 16 hours at 4°C. the mixture was 
used to transform E. coli JA221 as previously described. 
Transformed colonies were selected on agar plates 

10 containing 100 yg/ml ampicillin. Plasmids from 10 
colonies were prepared by the previously described 
Birnboim screening procedure. After digestion by 
restriction enzymes Xbal and BamHI followed by acryl- 
amide- gel electrophoresis one plasmid was found to 

15 contain the expected 604 bp fragment. This plasmid was 
amplified and the DNA sequence from the Xbal site 
through the FnuDII site was determined by the_ procedure 
described in Maxam, A. M. and Gilbert, W., Proc. Natl. 
Acad . Sci USA 74 , 560-564 (1977) and found to be cor- 

20 rect. The plasmid is hereafter referred to as pNM645 
(111 in Figure 5) . 

B, Expression of human growth hormone 

Initial expression of human growth hormone by 

25 the plasmid pNM645 in E. coli JA221 was detected by 
modifications of the solid phase radioimmunoassay 
procedures described by Broome, S., and Gilbert, W., 
Proc . Natl . Acad Sci USA 75, 2746-2749 (1978), Hitzeman, 
R. A. , et al . , ICN-UCLA Symposia on Molecular and Cell- 

30 ular Biology 14, 57-68 (1979) and Erlich, H. A., et 
al., Cell 13, 681-689 (1978). 
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SDS-polyacrylaraide gel analysis of total 
bacterial cell protein performed according to Leammli, 
U. K., Nature 227 , 680-685 (1970) revealed a major 
protein band of approximately 20,000 daltons. This 
5 band is estimated to be at least 10 percent of total 

protein and is not present in preparations of E. coli 
JA221 containing pKEN021. Quantitative expression was 
measured by a standard radioimmunoassay procedure of 
Twomey, S. L., Beattie, J. M. , and Wu, G. T. , Clin Chem 
10 20, 389-391 (1974) and found to be at least 2 million 

molecules per cell. The methionyl human growth hormone 
was partially purified from 500 gm E. coli cells by 
extraction with 8M urea and 1 percent Triton X100. The 
debris was removed by centrifugation and the supernatant 
*5 containing the soluble growth hormone was fractionated 
on a Whatman DE52 column. The peak fractions as 
determined by radioimmunoassay (RIA) were pooled and 
subjected to isoelectric precipitation. . This material 
was further purified on a Whatman SE53 column. The 
peak fractions were determined by RIA and the material 
was concentrated by isoelectric precipitation or 
ultrafiltration. 

The biological activity of the recovered 
methionyl human growth hormone was determined by 
measurement of the proximal epiphyseal cartilage width 
in hypophysectomized female rats according to the 
method of Greenspan, F. S., et al. , Endocrinolocry 45 , 
455-463 (1948). Its activity was found to be consis- 
tent with that of human growth hormone obtained from 
30 cadavers. 



20 
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Example 2 — Plasmid for the Expression of Methionyl 
Bovine Growth Hormone Using the Lipoprotein Promoter of 
E. coli 

5 Plasmid pNM645 (111 in Figure 6) , the ex- 

pression plasmid for methionyl human growth hormone was 
used as the starting material for construction of a 
plasmid expressing methionyl bovine growth hormone. 

Plasmid pBP348 (112 in Figure 6) , described 

10 in Miller, W. L. , et: al. , J. Biol . Chem . 255 , 7521- 
7524 (1980) , was used as the source of two DNA frag- 
ments containing the coding sequence for a portion of 
the bovine growth hormone gene. The plasmid contains 
an 831 bp sequence coding for bovine growth hormone 

15 cloned in the PstI ( 5 '-CTGCAG3 1 ) restriction site of 

pBR322. As an alternative to the method described in 
Miller et al . , the sequence for bovine growth hormone 
can be obtained from messenger RNA isolated from bovine 
pituitaries by now routine procedures described by 

20 Goodman, H. M. , et a^L. , Methods in Enzymology 68 , 
75-90 (1979). 

The coding sequences for human growth hormone 
and bovine growth hormone are very similar and show 
much homology. Particularly useful in the construction 

25 of the expression plasmid for bovine growth hormone 
were the fragments generated by digestion with the 
restriction enzyme PvuII (S'CAGCTGS 1 ) . The size of the 
fragments produced are 497 bp in human growth hormone 
and 494 bp. in bovine growth hormone. The corresponding 

30 restriction sites occur in the same coding frames in 
both sequences. 
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Ten micrograms of pNM645 (111 in Figure 6.) 
containing 3 PvuII sites per roolecul were digested 
with 1 unit of PvuII in 200 yl of PvuII restriction 
buffer (60mM NaCl, 6mM Tris:HCl pH 7.5, 6mM MgCl 2 , 6mM 

5 fl-mercaptoethanol) for 10 minutes at 37 °C. The re- 

action was stopped by heating at 65 °C. for 10 minutes, 
and the DNA was alkaline phosphatase treated. This 
limited digestion procedure leads to the cleavage of 
one-half to two-thirds of the PvuII sites present • The 

10 fragments were separated on a one percent agarose gel 
and the DNA fragment (113 in Figure 6) of the size 
corresponding to linear plasmid with the 497 bp PvuII 
fragment missing (runs slightly faster than single cut 
plasmid) was excised, purified and used as vector in 

15 the construction of intermediate plasmid pNM685 (114 in 
Figure 6) . 

A 494 bp PvuII fragment was prepared from 
PBP348. Ten micrograms of the plasmid were digested in 
200 yl PvuII buffer with 10 units of PvuII for 1 hour 

20 at 37 °C. The fragments were separated on a 6 percent 

polyacrylamide gel and the 494 bp fragment (from 112 in 
Figure 6) was visualized and purified by methods 
described previously. 

Intermediate plasmid pNM685 (114 in Figure 6) 

25 was constructed by ligation of 0.2 yg vector with 

0.05 yg of 494 bp fragment in 20 yl of T 4 DNA ligase 
buffer containing 2 units Tj DNA ligase fdr 16 hours at 
4°C. After transformation and selection of trans- 
formants for ampicillin resistance, plasmids prepared 

30 by the previously described Birnboim procedure were 
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analyzed for the presence of the 494 bp PvuII fragment. 
Proper orientation of the fragment was determined by 
sequential digestion with enzymes Xbal and Smal. The 
494 bp PvuII fragment from the bovine growth hormone 

5 sequence has a unique asyroetric Smal restriction site. 
Parent plasmid pNM645 contains no Smal sites. A plasmid 
with a 494 bp PvuII fragment and a 416 bp Xbal, Smal 
fragment was selected as the desired intermediate and 
was used in further constructions. 

10 Plasmid pNM685 (114 in Figure 7) was con- 

verted to a. bovine growth hormone expression plasmid by 
two procedures: (1) the coding sequence of the first 
22 amino acids of human growth hormone was removed and 
replaced with the coding sequence for the first 23 

15 amino acids of bovine growth hormone and (2) a short 
sequence . between the second PvuII site in the coding 
sequence to the stop codon (which is a human growth 
hormone sequence) was replaced with a synthetic frag- 
ment to restore the codon for alanine, the 190th amino 

20 acid of bovine growth hormone. 

Ten micrograms of pNM685 were digested with 
1 unit PvuII in 200 yl PvuII buffer for 5 minutes at 
37°C. The reaction was stopped by heating at 65°C. for 
10 minutes. The mixture of fragments was spread on a 1 

25 percent agarose gel and linear plasmid having only a 

single PvuII cut per molecule was recovered and purified 
This recovered material (approximately 3 yg) was 
digested completely with 5 units of Xbal and treated 
with alkaline phosphatase. The fragments were spread 

30 on a 1 percent agarose gel and the largest fragment 
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(missing the 85 bp fragment between Xbal and the first 
PvuII site in human and bovine growth hormone) was 
recovered and used as the cloning vector (115 in 
Figure 7 ) . 

5 The DNA sequence for the first 23 amino acids 

(69 bp) of bovine growth hormone to the first PvuII 
site contains 2 restriction sites for enzyme Hpall 
(S'CCGGS'). The first site is 23 bp from the first 
nucleotide of the coding sequence. A 42 bp fragment 

10 (116 in Figure 7) corresponding to the 19 bp sequence 
from the Xbal site in the Ipp ribosome binding site 
through the ATG initiation codon followed by the 
sequence for the first 23 bp of bovine growth hormone 
was synthesized by the phosphotriester method. 

15 The fragment has the following structure: 

Xbal Hpall 
5' 

CTAGAGGGTATTAATAATGGCTTTTCCGGCTATGTCTCTGTC 
3 J TCCCATAATTATTACCGAAAAGGCCGATACAGAGACAGGC 5» 

20 In producing the 42 bp fragment , the following 

six segments were prepared: 

1) CTAGAGGGTAT 

21 TAATAATGGCTTTTC 

25 3) CGGCTATGTCTCTGTC 

4) CATTATTAATACCCT 

5i TAGCCGGAAAAGC 

6) CGGACAGAGACA 



30 
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Using the above-prepared segments, 5 1 - 
phosphorylated segment 2, 5 '-phosphorylated segment 3, 
S'-phosphorylated segment 5 and 5 '-unphosphorylated 
segment 6 were ligated using T 4 ligase to form a duplex, 

5 which was purified by 15% polyacylamide gel electro- 
phoresis. To this duplex, 5 1 -unphosphorylated segment 1 
and 5*-phosphorylated segment 4 were added in the 
presence of T 4 ligase. The resulting 42 bp DNA duplex 
(116 in Figure 7) was isolated by 15% polyacrylamide 

10 gel electrophoresis. This duplex was then enzymatically 

phosphorylated at its 5' -ends using T. polynucleotide 

32 

kinase and [y-p ] ATP following established procedures. 

The DNA fragment of 46 bp which runs from the 
above described Hpall site to the PvuII site was obtained 

15 from the original pBP348 plasraid. One hundred micro- 
grams of plasmid were digested in 400 yl of PvuII 
buffer with 50 units of PvuII for 2 hours at 37 °C. 
After phenol extraction and ethanol precipitation the 
DNA was dissolved in 400 yl of PstI (S'CTGCAGS 1 ) buffer 

20 (50mM NaCl, 6mM Tris:HCl pH 7.4, 6mM MgCl 2 , 6mM 

mercaptoethanol) with 50 units of PstI for 2* hours at 
37 °C. The DNA fragments were spread on a 6 percent 
polyacrylamide gel (30 cm long) and the 135 bp fragment 
containing the desired 46 bp sequence was recovered and 

25 purified by standard procedures. One-third of the 
recovered DNA (equivalent to 33 yg of plasmid) was 
subjected to limited digestion by Hpall restriction 
enzyme. The DNA was digested in 100 pi Hpall buffer 
(20mM TrisrHCl pH 7»4, 7mM MgCl 2 , 6mM p-mercaptoethanol) 

30 with 1 unit of Hpall for 40 minutes at 37°C. The 
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reaction was stopped by heating at 65°C. for 10 minutes. 
The DNA fragments were run on a 5 percent acryl amide 
gel ( aery laraide : bis ratio 19:1). One microgram of 
pBR322 digested with SauIIIA restriction enzyme was run 

5 in a separate well. This mixture of fragments contains 
a 46 bp fragment which is used as a size marker. The 
46 bp fragment yielded by Hpall partial digestion of 
the 135 bp fragment (from 112 in Figure 7) was purified 
by standard procedures. 

10 Two- tenths microgram pi as mid vector (115 in 

Figure 7) having Xbal and PvuII ends was combined with 
1.6 picamoles of synthetic 42 bp fragment (116 in 
Figure 7) and 0.5-1 picamoles 46 bp fragment (from 112 
in Figure 7) in 10 *il ligation buffer with 2 units 

15 of T 4 DNA ligase and ligated for 16 hours at 4°C. The 

mixture was used to transform E. coli JA221, and plasmids 
were prepared from colonies selected by ampicillin 
resistance. The plasmids were screened for the presence 
of a 494 bp PvuII fragment and an 88 bp Xbal, PvuII 

20 fragment. Eighteen of thirty-six analyzed had these 

fragments. Two of the plasmids were sequenced from the 
Xbal site through the PvuII site and tested in a radio- 
immunoassay for bovine growth hormone. One was found 
which responded positively in the radioimmunoassay and 

25 had the correct sequence. This plasmid was designated 
PNM797 (117 in Figure 7) . Quantitative expression was 
measured by standard radioimmunoassay procedures for 
bovine growth hormone and found to be at least 10 5 
molecules per cell. 
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Plasmid pNM797 (117 in Figure 8) requires one 
amino acid codon change for complete conversion to 
bovine growth hormone. This is accomplished by the 
removal of the 28 bp PvuII to BamHI fragment of pNM797 
and replacement with a synthetic double strand fragment 
(13 bp upper strand, 17 bp lower strand) having the 
following sequence and shown at 118 in Figure 8: 

5' 3 1 

CTGTGCCTTCTAG 

3 f GACACGGAAGATCCTAG 5 , 



Ten micrograms of pNM797 are digested with 1 
unit of PvuII in 200 yl PvuII buffer for 5 minutes at 
37 °C. The enzyme is inactivated by heating 10 minutes 
at 65 °C. The sample is diluted to 300 yl with the 
^ addition of BamHI buffer and digested to completion 
with 10 units of BamHI for 1 hour at 37 °C. This is 
followed by the addition of 5 units of alkaline phos- 
phatase and incubation for 1 hour at 65°C. The DNA 
fragments are separated on a 1 percent agarose gel, 
20 and a DNA fragment (119 in Figure 8) the size of single 
cut plasmid is purified. Two-tenths microgram of this 
is ligated with 5 picamoles of synthetic fragment using 
2 units of T^ ligase in 20 yl ligase buffer overnight 
at 4°C. Following transformation and the previously 
2 ^ described Birnboim plasmid isolation procedure 9 several 
plasmid s are selected which contain the appropriate 
size PvuII fragment (494 bp) and Xbal, BamHI fragment 
(604 bpl . The sequence of at least two of these is 
determined from the BamHI site toward the unique Sroal 
site and one selected with the desired sequence (120 
in Figure 81. 
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Example 3 — Variation of Plasmid of Example 1 

A tetracycline resistant variation of pNM645 
(111 in Figure 9) was constructed by replacing the lpp 
3' sequence between BamHI and Sail restriction sites 
with a DNA fragment derived from pBR322 (102 in Figure 9) . 
Tetracycline resistance in pBR322 is conferred by the 
product of a gene whose promoter is cleaved by Hindlll 
(S'AAGCTTS') . The coding region for the gene begins 
nearby and extends through the BamHI and Sail restric- 
tion sites of the plasmid. The tetracycline promoter 
was destroyed by digestion of the sequence with Hindlll 
followed by Si nuclease treatment to remove the single 
strand ends* Five micrograms of pBR322 were digested 
in 200 pi of Hindlll buffer (60mM NaCl, 20mM Tris:HCl 
pH 7.4, lOmM MgCl 2 , 6mM p-mercaptoethanol) with 10 
units Hindlll for 1 hour at 37 °C, DNA was phenol/CHClj 
extracted, ethanol precipitated and resuspended in 
300 pi SI buffer (300mM NaCl, 25mM sodium acetate pH 
4.25, ImM ZnCl 2 ) . SI nuclease was added at 1000 
20 units/ml and incubated 1 hour at 15 °C. After phenol/CHCl 3 
extraction and ethanol precipitation the DNA was re- 
suspended in 200 pi Sail restriction buffer (150mM 
NaCl, 6mM TrisrHCl pH 7.9, 6mM MgCl 2 , 6mM &-mercapto- 
ethanoll and digested with 5 units SalX for 1 hour at 
37°C. Electrophoresis on a 6 percent polyacrylamide 
gel was used to isolate the 617 bp fragment generated. 
Recovery and purification of the fragment was as described 
previously. Five micrograms of pNM645 were digested 
with 5 units of BamHI for 1 hour at 37 °C. After 
phenol/CHCl^ extraction and ethanol precipitation the 
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DNA was dissolved in TEN. Two micrograms of pNM645 
with BamHI cohesive termini were converted to blunt 
ended DNA by "filling in" using 1 unit of the large 
fragment of E. coli DNA polymerase I in 20 pi DNA 

5 polymerase I buffer (70mM Tris:HCl pH 7.6 , lOmM MgCl 2 , 
lOmM 3-mercaptoethanol , 0.5mM each dATP, dCTP, dGTP, 
TTP) for 1 hour at 15 °C. The enzyme was denatured at 
65°C. for 10 minutes and the DNA was cleaved by the 
addition of Sail buffer and 3 units of Sail restriction 

10 enzyme for 1 hour at 37 °C. The DNA was separated on a 
1 percent agarose gel and the large plasmid fragment 
was eluted from the gel after freezing. Ethidium 
bromide and agarose fragments were removed by phenol/CHCl 
extraction and ethanol precipitation. The plasmid was 

15 dissolved in 20 yl of TEN. Two-taiths microgram plasmid 
vector (0.05 picamole) was ligated with 0.2 yg 617 bp 
fragment (0.5 picamole) using previously described 
conditions. Transformed E. coli JA221 colonies were 
selected on agar plates containing 100 pg/ml ampicillin 

20 and 15 pg/ml tetracycline. Plasmids (designated pNM736, 
121 in Figure 9) were isolated and found to contain 
desired sequence by restriction enzyme analysis. 
Expression of methionyl human growth hormone was found 
to be as fiigh as that for pNM645* 

2$ Example 4.. ; t- Plasmid for the Expression of Met-Phe-Pro- 
Leu-Asp-Asp-Asp-Asp-Lys-Human Growth Hormone and Its 
Use as Substrate for Selective Cleavage by Enterokinase 
t3.4*21.9) t to Product Mature Human Growth Hormone 

^ Q A double stranded DNA fragment (122 Figure 

10) was synthesized by the phosphotriester method 
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to join the Ipp promoter region with the human growth 
hormone coding region preceeded by a start codon and 
a coding region for a short peptide which defines a 
sequence recognized and cleaved by enterokinase . The 

5 upper strand has 90 nucleotides which includes on the 
5 f end the 4 nucleotide single stranded sequence pro- 
duced by Xbal cleavage. The lower strand has 86 
nucleotides which are complementary to the last 86 
nucleotides of the upper strand. The first part of 

10 the synthetic DNA fragment follows the natural sequence 
of the Ipp gene from the Xbal restriction site in the 
ribosome binding site through the translation initiating 
methionine codon (19 bp) and is followed by the se- 
quence for the enterokinase cleavage site and the first 

15 47 nucleotides of human growth hormone to the unique 
FnuDII site previously described. 

The double stranded DNA fragment (122 in 
Figure 10} has the following structure: 

Xbal 

20 5 1 CTAGAGGGTATTAATAATGTTCCCATTGGATGATGATGATAAGTTCCCAA- 

TCCCATAATTATTACAAGGGTAACCTACTACTACTATTCAAGGGTT- 

CCATTCCCTTATCCAGGCTTTTTGACAACGCTATGCTCCG 3 1 _ FnuDIX 
GGTAAGGGAATAGGTCCGAAAAACTGTTGCGATACGAGGC 5 1 

25 The fragment was prepared by recognized phos- 

photriester methodology by which the following segments 
were prepared: 

11 CTAGAGGGTAT 
2) TAATAATGTTCC 
30 3) CATTGGATGAT 
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4) 


GATGATAAGTTCC 


5) 


CAACCATTCCC 


6) 


TTATCCAGGC 


7) 


TTTTTGACAACG 


8) 


CTATGCTCCG 


?) 


CATTATTAATACCCT 


10) 


ATGGGAA 


ID 


CTTATCATCATCCA 


12) 


GGTTGGGAA 


13) 


GGATAAGGGAAT 


14) 


GTCAAAAAGCCT 


15) 


CGGAGCATAGCGTT 



Using the above-prepared segments, the 
ligase catalyzed joining reactions were performed 
stepwise as described below: 

a) S'-Unphosphorylated segment 1 was joined 
to S'-phosphorylated segment 2 in the presence of 
5'-phosphorylated segment 9 using T 4 ligase to form 
DNA duplex 1 IE. L. Brown, R. Belagaje, M. J. Ryan 
and H. G. Khorana, Methods in Enzymology 68 r 109-151 
(1979)3 ♦ The duplex was isolated by preparative gel 
electrophoresis on 15% polyacryl amide. 

lb) 5 • -Phosphorylated segment 3 was joined 
to 5 1 -phosphorylated segment 4 in the presence of 
S'-phosphorylated segment 11 using T 4 ligase to form 
DNA duplex 2 which was purified by 15% polyacrylamide 
gel electrophoresis. 
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c) 5 * -Phosphorylated segment 5 was joined 
to 5* -phosphorylated segment 6 in the presence of 5*- 
phosphorylated segments 12 and 13 using T 4 ligase to 
form DNA duplex 3 which was purified by 15% poly- 

5 acrylamide gel electrophoresis. 

d) 5 * -Phosphorylated segment 7 was joined to 
5 ' -phosphorylated segment 8 in the presence of 5*- 
phosphorylated segment 14 and 5 ' -unphosphorylated 
segment 15 using T 4 ligase to form DNA duplex 4 which 

10 was purified by 15% polyacryl amide gel electrophoresis. 

e) The DNA duplexes 2, 3 and 4 then were 
joined together by T 4 ligase to form DNA duplex 5 
which was purified by 15% polyacrylamide gel electro- 
phoresis. 

15 f) To the DNA duplex 1 then were added 5*- 

phosphorylated segment 10 and DNA duplex 5 in the 
presence of T 4 ligase, and the resulting DNA duplex 
(110 in Figure 10) was purified by 10% polyacrylamide 
gel electrophoresis. This DNA duplex then was enzy- 

20 matically phosphorylated using T 4 polynucleotide 
kinase and Iy~P 32 ]ATP by following the established 
procedure . 

The expression plasmid was constructed by - 
enzymatically joining 0.1 picamole (0.4 ug) plasmid 

25 vector (107 in Figure 5), 0.025 picamoles synthetic DNA 
fragment (110 in Figure 5) and 0.3 picamoles (0.08 yg) 
of 538 bp fragment (109 in Figure 10 , see Preparation) 
in 24 ul of ligation buffer using 1.5 units T, DNA 
ligase. After incubation for 16 hours at 4°C. the 

30 mixture was used to transform E. coli JA221 as previously 
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described* Transformed colonies were selected on agar 
plates containing 100 yg/ml ampicillin. Plasmids from 
19 colonies were prepared by the previously described 
Birnboim screening procedure. After digestion by 

5 restriction enzymes Xbal and BamHI followed by acryl- 
amide gel electrophoresis 12 plasmids were found to 
contain the expected 628 bp fragment. 

Eight of the positive plasmids were digested 
sequentially with Xbal and PvuII and seven of these 

10 yielded a 109 bp fragment. The sequence of one plasmid 
was determined by the procedure described by Maxam, 
A.M. and Gilbert, W. , Proc . Natl . Sci . USA 74, 560-564 
(1977) and found to be correct. The plasmid was 
designated pNM702 (123 in Figure 10) . Expression of 

15 human growth hormone was detected by a standard radio- 
immunoassay procedure described by Twomey, S.L., et 
al. , Clin. Chem . 20, 389-391 (1974). Quantitative 
expression was determined to be at least 2 million 
molecules per cell. 

20 Met-phe-pro-leu (asp) 4 lys-human growth 

hormone was partially purified from 500 gm E. coli 
cells by extraction with 8M urea and 1 percent Triton 
X100. The debris was removed by centrifugation and the 
supernatant containing the soluble human growth hormone 

25 product was fractionated on a Whatman DE52 column. The 
peak fractions as determined by radioimmunoassay (RIA) 
were pooled and subjected to isoelectric precipitation. 
This material was further purified on a Whatman SE53 
column. The peak fractions were determined by RIA and 

30 the material was concentrated by isoelectric precip- 
itation or ultrafiltration. 
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The partially purified material was subjected 
to cleavage by enterokinase. Crude porcine intestine 
enterokinase (Miles Laboratories) was further purified 
by the method of Anderson , et al . , Biochemistry 16 , 

5 3354-3360 (1977) . Enterokinase was incubated with 
substrate, and samples were removed at intervals for 
examination on an isoelectric focusing gel. The 
starting material has an isoelectric point of 4.3 and 
can be seen to shift with time to a band having the 

10 isoelectric point of human growth hormone (4.91). 

Example 5 — Plasmid for the Expression of Met-Phe-Pro- 
Leu-Asp-Asp-Asp-Asp-Lys-Bovine Growth Hormone Using the 
Lipoprotein Promoter of E. coli 

15 Plasmid pNM702 (123. in Figure 11) , the ex- 

pression plasmid for human growth hormone was used as 
the starting material for construction of a plasmid 
expressing Met-Phe-Pro-Leu-Asp-Asp-Asp-Asp-Lys-bovine 
growth hormone. 

20 Plasmid pBP348 (124 in Figure 11) , described 

in Miller, W. L., et al . , J. Biol . Chem . 255 , 7521- 
7524 (1980), was used as the source of two DNA frag- 
ments containing the coding sequence for a portion of 
the bovine growth hormone gene. The plasmid contains 

25 an 831 bp sequence coding for bovine growth hormone 
cloned in the PstI (S'CTGCAGS 1 ). restriction site of 
PBR322. As an alternative to the method described in 
Miller et al . , the sequence for bovine growth hormone 
can be obtained from messenger RNA isolated from bovine 

30 P ituitarie s by now routine procedures described by 
Goodman, H. M. , et al. , Methods in Bnzymology 68 , 
75-90 (1979). 
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As noted above, the coding sequences for 
human growth hormon and bovine growth hormone are very 
similar and show much homology. The fragments, generated 
by digestion with the restriction enzyme PvuII (5'CAGCTG3') 
5 were also useful in the construction of this expression 
plasmid for bovine growth hormone. 

Ten micrograms of pNM702 (123 in Figure 11) 
containing 3 PvuII sites per molecule are digested with 
1 unit of PvuII in 200 ]il of PvuII restriction buffer 
10 (60mM NaCIV 6mM Tris:HCl pH 7.5, 6mM MgClj, 6mM 3- 

mercaptoethanol} for 10 minutes at 37 °C. The reaction 
is stopped by heating at 65 °C. for 10 minutes, and the 
DNA was alkaline phosphatase treated. This limited 
digestion procedure leads to the cleavage of one-half 
15 to two-thirds of the PvuII sites present. The frag- 
ments are separated on a one percent agarose gel and 
the DNA fragment (125 in Figure 11) of the size cor- 
responding to linear plasmid with the 497 bp PvuII 
fragment missing (runs slightly faster than single cut 
20 plasmid) was excised, purified and used as vector in 
the construction of intermediate plasmid (126 in 
Figure 111 • 

A 494 bp PvuII fragment was prepared from 
pBP348. Ten micrograms of the plasmid were digested in 
25 200 pi PvuII buffer with 10 units of PvuII for 1 hour 
at 37°C. The fragments were separated on a 6 percent 
polyacrylamide gel and the 494 bp fragment (from 124 
in Figure 11) was visualized and purified by methods 
described previously. 
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Intermediate plasmid (126 in Figure 11) is 
constructed by ligation of 0.2 yg vector with 6.05 yg 
of 494 bp fragment in 20 yl of DNA ligase buffer 
containing 2 units T 4 DNA ligase for 16 hours at 4°C. 

5 After transformation and selection of transformants for 
ampicillin resistance, plasmids prepared by the pre- 
viously described Birnboim procedure are analyzed for 
the presence of the 494 bp PvuII fragment. Proper 
orientation of the fragment is determined by sequential 

10 digestion with enzymes Xbal and Smal. The 494 bp PvuII 
fragment from the bovine growth hormone sequence has a 
unique asymetric Smal restriction site. Parent plasmid 
pNM702 contains no Smal sites. A plasmid with a 494- bp 
PvuII fragment and a 440 bp Xbal, Smal fragment is 

15 selected as the desired intermediate and is used in 
further constructions. 

Intermediate plasmid (126 in Figure 12) is 
converted to the desired fused bovine growth hormone 
expression plasmid by two procedures: (1) the coding 

20 sequence of the first 30 amino acids of enterokinase 

substrate-human growth hormone was removed and replaced 
with the coding sequence for the first 31 amino acids .. 
of enterokinase substrate-bovine growth hormone and 
(21 a short sequence between the second PvuII site in 

25 the coding sequence to the stop codon (which is a human 
growth hormone sequence) is replaced with a synthetic 
fragment to restore the codon for alanine, the 190th 
amino acid of bovine growth hormone. 
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Ten micrograms of the intermediate plasmid 
(126 in Figure 12) are digested with 1 unit PvuII in 
200 yl PvuII buffer for 5 minutes at 37 °C. The re- 
action is stopped by heating at 65 °C. for 10 minutes. 

5 The mixture of fragments is spread on a 1 percent 
agarose gel and linear plasmid having only a single 
PvuII cut per molecule is recovered and purified. This 
recovered material (approximately 3 pg) is digested 
completely with 5 units of Xbal and treated with 

10 alkaline phosphatase. The fragments are spread on a 1 
percent agarose gel and the largest fragment (missing 
the 109 bp fragment between Xbal and the first PvuII 
site in human and bovine growth hormone) is recovered 
and used as the cloning vector (127 in Figure 12) . 

15 The DNA sequence for the first 23 amino acids 

(69 bp) of bovine growth hormone to the first PvuII 
site contains 2 restriction sites for enzyme Hpall 
(S'CCGGS'l. The first site is 23 bp from the first 
nucleotide of the coding sequence. A 63 bp fragment 

20 (128 in Figure 12) was synthesized by the phospho- 
triester method. This fragment corresponds to the 
19 bp sequence from the Xbal site in the lpp ribosome 
binding site through the ATG initiation codon followed 
by the coding sequence for Phe-Pro-Leu-Asp-Asp-Asp-Asp- 

25 I*ys (24 bp) and 20 nucleotides of the coding sequence 
of bovine growth hormone from Phe to the first Hpall 
site. 
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The fragment has the following structure: 

Xbal 

5 ' CTAGAGGGTATTAATAATGTTCCCATTGGATGATGATGATAAG- 
3 • TCCCATAATTATTACAAGGGTAACCTACTACTACTATTC- 
5 Hpall • 

TTCCCAGCCATGTCCTTGTC 3* 
AAGGGTCGGTACAGGAACAGGC 5 ' 

In producing the 63 bp fragment, the following 
nine segments were prepared: 



10 



15 



1) 


CTAGAGGGTAT 


2) 


TAATAATGTTCC 


3) 


CATTGGATGAT 


4) 


GATGATAAGTTCC 


5) 


CAGCCATGTCCTTGTC 


6) 


ATGGGAACATTATTAATACCCT 


7) 


TTATCATCATCATCCA 


8) 


ATGGCTGGGAAC 


9) 


CGGACAAGGAC 



Using the above-prepared segments , the T 4 
20 ligase catalyzed joining reactions were performed 
stepwise as described below: 

al 5 • -Unphosphorylated segment 1 was joined 
to S'-phosphorylated segment 2 in the presence of 5 1 - 
phosphorylated segment 6 using T 4 ligase to form DNA 
25 duplex 1 which was purified by 15% poly aery 1 amide gel 
electrophoresis . 

bl 5 1 -Phosphorylated segments 3, 4 and 5 were 
joined in the presence of 5 1 -phosphorylated segments 7 
and 8 and 5 '-unphosphorylated segment 9 using T 4 ligase 
30 to form DNA duplex 2 which was purified by 15% poly- 
acrylamide gel electrophoresis. 
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c) Duplexes 1 and 2 then were joined by T 4 

ligase to form DNA duplex (128 in Figure 12) which was 

purified by 15% poly aery 1 amide gel electrophoresis . 

This DNA duplex then was enzymatically phosphorylated 

32 

5 using T 4 polynucleotide kinase and Iy~P ]ATP following 
established procedure. 

The DNA fragment of 46 bp which runs from the 
above described Hpall site to the PvuII site was obtained 
from the original pBP348 plasmid. One hundred raicro^ 

10 grams of plasmid were digested in 400 yl of PvuII 
buffer with 50 units of PvuII for 2 hours at 37 °C. 
After phenol extraction and ethanol precipitation the 
DNA was dissolved in 400 yl of PstI (S'CTGCAGS 1 ) buffer 
(50mM NaCl, 6mM Tris:HCl pH 7.4, 6mM MgClj, 6mM &- 

!5 mercaptoethanol) with 50 units of PstI for 2 hours at 
37 °C. The DNA fragments were spread on a 6 percent 
polyacrylamide gel (30 cm long) and the 135 bp fragment 
containing the desired 46 bp sequence was recovered and 
purified by standard procedures. One-third of the 

20 recovered DNA (equivalent to 33 yg of plasmid) was 
subjected to limited digestion by Hpall restriction 
enzyme. The DNA was digested in 100 yl Hpall buffer 
(20roM Tris:HCl pH 7.4, 7mM MgCl 2 , 6mM 0-mercaptoethanol) 
with 1 unit of Hpall for 40 minutes at 37°C. The 

25 reaction was stopped by heating at 65°C. for 10 minutes. 
The DNA fragments were run on a 5 percent acrylamide 
gel (acrylamide: bis ratio 19:1). One microgram of 
pBR322 digested with SauIIIA restriction enzyme was run 
in a separate well. This mixture of fragments contains 

30 a 46 bp fragment which is used as a size marker. The 
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46 bp fragment yielded by Hpall partial digestion of 
the 135 bp fragment (from 124 in Figure 12) was purified 
by standard procedures. 

Two-tenths microgram plasmid vector (127 

5 in Figure 12) having Xbal and PvuII ends was combined 
with 3.2 picamoles of synthetic 63 bp fragment (128 
in Figure 12) and 0.5-1 picamoles 46 bp fragment (from 
124 in Figure 12) in 10 pi ligation buffer with 2 units 
of T 4 DNA ligase and ligated for 16 hours at 4°C. The 

10 mixture was used to transform E. coli JA221, and plasmids 
were prepared from colonies selected by ampicillin 
resistance. The plasmids were screened for the presence 
of a 494 bp PvuII fragment and a 109 bp Xbal, PvuII 
fragment. One of twelve analyzed had these fragments. 

15 This plasmid was sequenced from the Xbal site through 
the PvuII site and tested in a radioimmunoassay for 
bovine growth hormone. It was found to respond pos- 
itively in the radioimmunoassay and had the correct 
sequence. This plasmid was designated pNM789 (129 

20 in Figure 12) . Quantitative expression was measured by 
standard radioimmunoassay procedures for bovine growth 
hormone and found to be at least 10 5 molecules per 
cell. 

Plasmid pNM789 (129 in Figure 13) requires 
25 one amino acid codon change for complete conversion to 
bovine growth hormone. This is accomplished by the 
removal of the 28 bp PvuII to BamHI fragment of pNM789 
and replacement with a synthetic double strand fragment 
(13 bp upper strand, 17 bp lower strand) having the 
30 following sequence and shown at 130 in Figure 13: 
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5 ' CTGTGCCTTCTAG 3 ' 

3 , GACACGGAAGATCCTAGg t 

Ten micrograms of pNM789 are digested with 1 

5 unit of PvuII in 200 yl PvuII buffer for 5 minutes at 
37°C, The enzyme is inactivated by heating 10 minutes 
at 65°C. The sample is diluted to 300 yl with the 
addition of BamHI buffer and digested to completion 
with 10 units of BamHI for 1 hour at 37°C. This is 

10 followed by the addition of 5 units of alkaline phos- 
phatase^and incubation for 1 hour at 65 °C. The DNA 
fragments are separated on a 1 percent agarose gel r 
and a DNA fragment (131 in Figure 13) the size of single 
cut plasmid is purified* Two-tenths microgram of this 

15 is ligated with 5 picamoles of synthetic fragment using 
2 units of T 4 ligase in 20 yl ligase buffer overnight 
at 4°C. Following transformation and the previously 
described Birnboim plasmid isolation procedure , several 
plasmids are selected which contain the appropriate 

20 size PvuII fragment (494 bp) and Xbal, BamHI fragment 
(628 bp)* The sequence of at least two of these is 
determined from the BamHI site toward the unique Smal 
site and one selected with the desired sequence (132 
in Figure 13) . 

25 n 
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CLAIMS 

1. A recombinant DNA cloning vector useful 
for expressing exogenous protein, which comprises 

(a) a DNA segment containing a functional 
5 origin of replication; 

(b) one or more DNA segments, each of which 
conveys to a transformable host cell a 
property useful for selection when said 
vector is transformed into said host 

10 cell; and 

(c) a DNA segment comprising a sequence that 
defines in tandem, 

(1) the promoter of a lipoprotein ex- 
pression control sequence, 
15 * 2) the 5 ' untranslated region of a 

lipoprotein expression control 
sequence and ... 
(3) a translation start codon followed, 
without interposition of a portion 
20 or a11 of a nucleotide sequence- 

coding for endogenous protein, 

(i) by a sequence coding for an 
exogenous protein, or 

(ii) by a nucleotide sequence 
25 coding for an enterokinase 

cleavage site to which is 
joined, without interruption, 
a nucleotide sequence coding 
for an exogenous protein. 

30 
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2. The vector of claim 1 wherein the 
exogenous prot in nucleotide s quence codes for human 
growth hormone or bovine growth hormone, 

3 • The vector of claim 1 or 2 wherein the 
5 nucleotide sequence of the promoter and that of the 
5* untranslated region are derived from gram-negative 
bacteria * 

4. The vector of claim 3 wherein the 
nucleotide sequence of the promoter and that of the 5' 

10 untranslated region are derived from the same gram- 
negative bacteria* 

5. The vector of claim 4 wherein the 
nucleotide sequence of the promoter and that of the 
5 1 untranslated region are derived from E. coli * 

15 6. The vector of any of claims 1 to 5 which 

contains in whole or in part the 3 • untranslated region 
of a lipoprotein expression control sequence, said 3 1 
untranslated region being located downstream of the 
sequence coding for the exogenous protein. 

2o 7. The vector of any of claims 1 to 6 which 

contains in whole or in part the transcription termination 
region of a lipoprotein expression control sequence, 
said transcription termination region being located 
downstream of the sequence coding for the exogenous 

25 protein. 

8. The vector of claim 7 which contains in 
whole or in part the transcription termination region 
of a lipoprotein expression control sequence., said 
transcription termination region being located down- 
30 stream of the 3' untranslated region. 
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9. The vector of any of claims 1 to 8 
wherein the enterokinase cleavage site codes for a 
sequence of amino acids comprising Asp-Asp-Asp-Asp-Lys, 

10. The vector of claim 9 wherein the DNA 
5 sequence coding for the enterokinase cleavage site 

cn i GATGATGATGATAAG 
P CTACTACTACTATTC * 

11. The vector of claim 9 or 10 wherein 
the enterokinase cleavage site codes for Phe-Pro-Leu- 

10 Asp-Asp-Asp-Asp-Lys . 

12. The vector of claim 11 wherein the DNA 

sequence coding for the enterokinase cleavage site is 

TTCCCATTGGATGATGATGATAAG 
AAGGGTAACCTACTACTACTATTC * 

15 13. Plasmid pNM645. 

14. Plasmid pNM797. 

15. Plasmid pNM736. 

16. Plasmid pNM702. 

17. Plasmid pNM789. 

20 18. DNA sequence of the formula 

5* CTAGAGGGTATTAATAATGTTCCCAACCATTCCCTTATCC- 
3 " TCCCATAATTATTACAAGGGTTGGTAAGGGAATAGG- 

AGGCTTTTTGACAACGCTATGCTCCG 3 • 
TCCGAAAAACTGTTGCGATACGAGGC 5' 



25 



19. DNA sequence of the formula 

5 CTAGAGGGTATTAATAATGGCTTTTCCGGCTATGTCTCTGTC 3 * 
3 1 TCCCATAATTATTACCGAAAAGGCCGATACAGAGACAGGC 5 1 



30 
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20. DNA sequence of the formula 



5 » CTAGAGGGTATTAATAATGTTCCCATTGGATGATGATGATAAGTTCCCAA- 
3 1 TCCCATAATTATTACAAGGGTAACCTACTACTACTATTCAAGGGTT- 

CCATTCCCTTATCCAGGCTTTTTGACAACGCTATGCTCCG 3 9 
GGTAAGGGAATAGGTCCGAAAAACTGTTGCGATACGAGGC 5 1 

21. DNA sequence of the formula 

CTAGAGGGTATTAATAATGTTCCCATTGGATGATGATGATAAG- 
TCCCATAATTATTACAAGGGTAACCTACTACTACTATTC- 

TTCCCAGCCATGTCCTTGTC 3' 
AAGGGTCGGTACAGGAACAGGC 5 1 

22. DNA sequence of the formula 

GATGATGATGATAAG 
CTACTACTACTATTC . 

23. DNA sequence of the formula 

TTCCCATTGGATGATGATGATAAG 
AAGGGTAACCTACTACTACTATTC . 
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